Channel: DeepMind
Category: Science & Technology
Description: Research Scientist Hado van Hasselt covers prediction algorithms for policy improvement, leading to algorithms that can learn good behaviour policies from sampled experience. Slides: dpmd.ai/modelfreecontrol Full video lecture series: dpmd.ai/DeepMindxUCL21